ConceptNet: A Practical Commonsense Reasoning Toolkit

نویسنده

  • Hugo Liu
چکیده

ConceptNet is a freely available commonsense knowledgebase and natural-language-processing toolkit which supports many practical textual-reasoning tasks over real-world documents including topic-jisting (e.g. a news article containing the concepts, “gun,” “convenience store,” “demand money” and “make getaway” might suggest the topics “robbery” and “crime”), affect-sensing (e.g. this email is sad and angry), analogy-making (e.g. “scissors,” “razor,” “nail clipper,” and “sword” are perhaps like a “knife” because they are all “sharp,” and can be used to “cut something”), and other contextoriented inferences. The knowledgebase is a semantic network presently consisting of over 1.6 million assertions of commonsense knowledge encompassing the spatial, physical, social, temporal, and psychological aspects of everyday life. Whereas similar large-scale semantic knowledgebases like Cyc and WordNet are carefully handcrafted, ConceptNet is generated automatically from the 700,000 sentences of the Open Mind Common Sense Project – a World Wide Web based collaboration with over 14,000 authors. ConceptNet is a unique resource in that it captures a wide range of commonsense concepts and relations, such as those found in the Cyc knowledgebase, yet this knowledge is structured not as a complex and intricate logical framework, but rather as a simple, easy-to-use semantic network, like WordNet. While ConceptNet still supports many of the same applications as WordNet, such as query expansion and determining semantic similarity, its focus on concepts-rather-than-words, its more diverse relational ontology, and its emphasis on informal conceptual-connectedness over formal linguistic-rigor allow it to go beyond WordNet to make practical, context-oriented, commonsense inferences over real-world texts. In this paper, we first give an overview of the role that commonsense knowledge plays in making sense of text, and we situate our commonsense toolkit, ConceptNet, in the literature of large-scale semantic knowledgebases; we then discuss how ConceptNet was built and how it is structured; third, we present the ConceptNet natural-language-processing engine and describe the various practical reasoning tasks that it supports; fourth, we delve into a more detailed quantitative and qualitative analysis of ConceptNet; fifth, we review the gamut of real-world applications which researchers have built using the ConceptNet toolkit; we conclude by reflecting back on the Big Picture. INTRODUCTION In today’s digital age, text is the primary medium of representing and transmitting information, as evidenced by the pervasiveness of emails, instant messages, documents, weblogs, news articles, homepages, and printed materials. Our lives are now saturated with textual information, and there is an increasing urgency to develop technology to help us manage and make sense of the resulting information overload. While keyword-based and statistical approaches have enjoyed some success in assisting information retrieval, data mining, and natural language processing (NLP) systems, there is a growing recognition that such approaches deliver too shallow an understanding. To continue to make progress in textual-information management, vast amounts of semantic knowledge are needed to give our software the capacity for deeper and more meaningful understanding of text. What is Commonsense Knowledge? Of the different sorts of semantic knowledge that are researched, arguably the most general and widely applicable kind is knowledge about the everyday world that is possessed by all people – what is widely called ‘commonsense knowledge’. While to the average person the term “common sense” is regarded as synonymous with “good judgment,” to the AI community it is used in a technical sense to refer to the millions of basic facts and understandings possessed by most people. A lemon is sour. To open a door, you must usually first turn the doorknob. If you forget someone’s birthday, they may be unhappy with you. Commonsense knowledge, thusly defined, spans a huge portion of human experience, encompassing knowledge about the spatial, physical, social, temporal, and psychological aspects of typical everyday life. Because it is assumed that every person possesses common sense, such knowledge is typically omitted from social communications, such as text. A full understanding of any text then, requires a surprising amount of common sense, which currently only people possess. It is our purpose to find ways to provide such common sense to machines.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ConceptNet — a practical commonsense reasoning tool - kit

ConceptNet is a freely available commonsense knowledge base and natural-language-processing tool-kit which supports many practical textual-reasoning tasks over real-world documents including topic-gisting, analogy-making, and other context oriented inferences. The knowledge base is a semantic network presently consisting of over 1.6 million assertions of commonsense knowledge encompassing the s...

متن کامل

Commonsense Reasoning in and Over Natural Language

ConceptNet is a very large semantic network of commonsense knowledge suitable for making various kinds of practical inferences over text. ConceptNet captures a wide range of commonsense concepts and relations like those in Cyc, while its simple semantic network structure lends it an ease-of-use comparable to WordNet. To meet the dual challenge of having to encode complex higher-order concepts, ...

متن کامل

Ethnomethodology and Conversational Analysis

In a speech community, people utilize their communicative competence which they have acquired from their society as part of their distinctive sociolinguistic identity. They negotiate and share meanings, because they have commonsense knowledge about the world, and have universal practical reasoning. Their commonsense knowledge is embodied in their language. Thus, not only does social life depend...

متن کامل

Commonsense knowledge bases and network analysis

We explore the ConceptNet 4 common-sense knowledge base using the tools of network analysis. This exploration is interesting in its own right, but a primary goal is developing methods that are potentially useful for improving the performance of the knowledge base on various common-sense reasoning tasks. ConceptNet has been used by its creators and others for various tasks, and ConceptNet 4 has ...

متن کامل

How a General-Purpose Commonsense Ontology can Improve Performance of Learning-Based Image Retrieval

The knowledge representation community has built general-purpose ontologies which contain large amounts of commonsense knowledge over relevant aspects of the world, including useful visual information, e.g.: “a ball is used by a football player”, “a tennis player is located at a tennis court”. Current state-of-the-art approaches for visual recognition do not exploit these rule-based knowledge s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004